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It has long been a puzzle how to solve random multiplicative cascade structures 
analytically. We present an analytical solution found recently in the form of a 
simple pedagogical example of the general case. 



This contribution is intended to serve as an introduction-by-example for 
prospective students and the casual observer. For more detailsuand greater 
rigour, the interested reader should consult the published paperstlm 

What are multiplicative random cascade models (MRCM's), and why are 
they relevant? Turbulence in fluids must necessarily obey the Navier-Stokes 
equation, while the QCD lagrangian in principle describes all high-energy in- 
teractions and final states. Nevertheless, many experimental results cannot be 
reproduced from the underlying theories, even though they are known. We 
hence look to models to reproduce essential features of physical data and 
hence point the way. This approach has a long and fruitful historyu 

Multiplicative variables do not obey classical central limit theorems but 
rather exhibit large fluctuations even in the limit of large numbers. Large 
fluctuations, unexplainable in terms of additive variables, are seen in the energy 
dissipation e in fully developed turbulence and in the number and density of 
pions in the final states of high-energy hadronic collisions. 

The models contain random variables because distributions of relevant 
quantities differ greatly from event to event in high energy physics and corre- 
spondingly in different spatial "snapshots" or pieces of time series of e. 

Hierarchical structures or cascades are used because they can easily be 
made scale invariant and are hence well placed to mimick the scale invari- 
ance seen in the Richardson cascades, the original Kolmogorov theory, as well 
as some deviations from it. In high-energy e + e _ , lepton-hadron and hadron- 
hadron collisions, particle production can be described at least in part as cas- 
cades of partons and hadrons. 
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A typical simple MRCM goes as follows. Start with an initial interval of 
unit length, on which a total "energy dissipation" e is measured^]. Divide the 
interval into two equal pieces of length i and split e randomly into a "left" 
piece e and a "right" piece e-y. These are calculated from the "parent" e by 

e = q e, t\ = qyt , (1) 

where the multiplicative random variables go and 9i are found by rolling the 
dice, the probability of obtaining a particular value for q$ and q± being deter- 
mined by the model's "splitting function" p(q<a, q\). For example, the so-called 
p-model has the splitting function 

p(qo, <7l) = § [S(qo - (1 + /?)) + S(q Q - (1 - /?))] S(q + qi-2) 1 (2) 

where (3 is an adjustable parameter determining how strong the fluctuations 
will be. In other words, go has an equal chance of being 1+/3 or 1— /?, with qi 
automatically taking the other option.^ In the next step, the left "0" length 
interval is again split into "left" and "right" subintervals, and the energy den- 
sities are given by 

eoo = <7oo£o = <7oo<7oe and e i = <7oieo = goigoe , (3) 

where the q's are again determined by throwing dice weighted by p(qoo,qoi)- 
The same kind of thing happens in the "1" interval also, so that we have 
four subintervals with four energy densities eoo, eoi, eio and en. This can be 
continued for as long as we like: after J such cascade steps we end up with 2 J 
intervals with one e per interval, characterised by some binary address. 

The correspondence between these models and data is established by pre- 
tending that the measured e's in an experimental data set result directly from 
such a cascade (of course this is a big simplification), and then to compare 
correlations between e's from different bins to those given by the model. In 
this way, one can see whether such models reproduce the correlation data. 

The problem is that so far it has been impossible to find systematically 
analytic expressions for correlations in these models; only a few special cases 
were solved. Here, we show in a very simple example how, if you do everything 
in the logarithms of the e's, then you can get analytical answers easily. 

Assume that we have a cascade of just two steps, J = 2, so that we have 
four intervals. The tree for our little example cascade is shown in Figure 1. 



b The variable e can correspond to different physical quantities for different systems; for our 
purposes it is important only that the e's must be positive scalars. 

c The delta function 6(qo + qi — 2) ensures that qo and q\ add up to 2 because eg an d e i are 
energy densities so that the left and right energies Eq = eo • (j) and Ei = ei ■ (^) add 
up to E = e/1. Hence we call this model "energy-conserving". 



2 



p(qo qi ) 




p(qoo qoi ) 
qoo / \ qoi 











lne 00 


lne M 


lne 10 


lne u 



p(lnE(Ki lne 01 lne 10 lne ) 

Figure 1: Branches, splitting functions p, multipliers q and (logs of) final 
energy densities In e for a two-step multiplicative cascade. 



The generating function for these four bins, 
Zt{^oo, Xqi, Aio, An) 



= /g(Aoo lneoo + Aoi lneni+Ain lnein+An lneu) 



(4) 



contains all possible information on the model, because any correlation can be 
found from it (in the form of moments and cumulants) by taking derivatives 
of Zt- For example, the "moment between bin 00 and 10" is given by 



(lne 00 lneio> = Poo.io 



d 2 Z T 



Ann— An i — Am— An— 



(5) 



and the corresponding cumulant is found from (lneoohieio) — (In eoo) (In eio) = 
Coo, 10 = (d 2 \nZT/d\oodXio) | - Higher orders are calculated in the same way. 
Hence, if we know Zt analytically, all correlations are known analytically too. 

We now show that Zt factorizes by first writing out the average ( ) in Eq. 
(0) in terms of the joint distribution function p(\n eoo, In eo%, In eio, In en), 



Zt = J dlneoodlneoidlneiodlneii p(lne 00 , hie i, mei , hien) 



xe 



(Ann lneno+Ani lneni+Aio lnein+An lneu) 



(G) 



Since e&; = qkiqkt, the logs are additive, lne^ = lnq^ + lnqx for all k, I. (As 
the initial energy density is a constant, we can set it to 1 so that lne = 0.) We 
can hence write p(lneoo, hieoi, lneio, lnen) m terms of the splitting functions 
by fixing the e^/'s to the appropriate q's: 

p(lneoo, ■ • • ,lnen) = J dq dq x dqo dq 01 dq w dq xl p(q , q x ) p(q o, qoi) p(<?io, qn) 

x5[ln e 00 - In q 0Q - In q ] 5 [In e 01 - In q 01 - In g ] 
x5[lneio-lngi -lngi] J [In en- lnq n - In q x ] . (7) 
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Inserting this into Eq. (g), we see that the four delta functions kill the dliaeki 
integrals, so that we are left with 



dq dqip{q a ,qi) J dq 00 dq 01 p(q 00 , q 01 ) J dq 1Q dq xl p{q w , qu) 

xe [ A oo(lii9oo+ln<?o)+- ) >oi(ln(?oi+lii'Jo)+^lo(ln(Jio+liigi)+Aii(lnijii+ln(;i)] ^g-j 

which can be written in a factorised form: 

MAoo.Aor.Aro.Aii) - / dg ^ip(-?o, 9i) e [(Aoo+Aoi)ln9o+(Alo+All)ln911 
X j dqovdq olP (q o,qoi)e [Xoolnqaa+Xo1 ln9ol] 
X I dq w dq 11 p{q w ,q 11 )e^ R ^ +X ^ a ^ , (9) 



in other words, 

Zt(^oo, Aqi, Aio, An) = Z(\q, Ai) Z(Xoo, Aoi) ^(Aio, An) , (10) 

where we have defined Ao = Aoo + Aoi and Ai = Aio + An. Each of the three 
generating functions on the right is the Laplace transform of the corresponding 
splitting function; for example 

^(Aoo,A i) = /d<Zood<Zoip(<Zoo,goi)e [Aooln * 0+Ao1 lnqol] . (11) 



The importance of Eq. ( |Io|) is that it makes explicit the statistical independence 
of the different branchings. This we knew, of course, because each branching 
had its own independent splitting function p, but we could not previously make 
this independence explicit in the generating functions. 

The independence becomes even more marked when writing (10) in terms 
of the cumulant branching generating functions Q = In Z: the total cumulant 
generating function becomes a simple sum of cumulant branching generating 
functions (CBGF), 

In Z t (\q , A i, Ai , An) = Q(A , Ai) + Q(A 00 , A i) + Q(Ai , An) . (12) 

The above derivation can be generalised to any number of cascade steps: for 
J cascade steps leading to 2 J bins and A's, the overall cumulant generating 
function is the sum of CBGF's, one for each branching: 

,/ l 

lnZ T = '^2 2^ Q(Afc 1 fc 2 ...fc 3 _ 1 o, Afe 1 fe 2 ...fe i _ 1 i) , (13) 



where the indices k\k2 • ■ ■ kj are just the binary addresses of the different bins 
as before. Also, just as above we had Ao = Aoo + Aoi and Ai = Aio + An, there 
is a "tree of parameters" 



\fei--fej 



E 



A 



(14) 



The tree structures of (|13|) and ( |14| ) are 
shown in Figures 2 and 3. The hier- 
archy of Q's shows how each one gov- 
erns the behaviour of the local branch- 
ing and how the Q's sum to In Zt- Each 
A shown in Figure 3 is the sum of all the 
A's in the tree below it. 

Let us take a concrete example 
to show what these hierarchies imply. 
To find the second cumulant between 
the bins "101" and "110", we note that 
{d\n Zt / d\\m) contains contributions 
from the branchings involving Ai, Aio 
and Aioi because these all "contain" the 
relevant Aioi through the sum hierar- 
chy of Fig. 3. These contributions are 
shown by the circled branching points 
in Fig. 4a. Similarly, the derivative 
(91nZr/i9Aiio) contains contributions 
as shown in Fig. 4b. Since the lowest 
circles in Fig. 4a and 4b respectively are 
completely uncorrelated (their Q's are 
independent, after all), it is thus plau- 
sible that only those branching points 
common to both (shown in Fig. 4c) 
will contribute to the second deriva- 
tive (d 2 lnZT/dAioi 9 Ano); indeed this 
is so. Hence the relevant cumulant can 
simply be read off as 




In Zx 

Figure 2: Cumulant branch- 
ing generating functions Q, 
one at each branching point. 
The overall generating func- 
tion In Zt is the sum of all Q's. 



^ioo ^-ioi ^ i io ^<iii 
/\ /\ /\ /\ 

Figure 3: Each A is the sum of 
all A's in the tree below it. 



c 



101,110 



d 2 In Z T 
dXxoi d\ 



no 



d 2 Q(A ,Ai) 
dX 2 



d 2 Q(Xio,Xn) 
d\ 10 dX n 



(15) 



5 






Figure 4: Example of calculating the cumulant Cioi.no from "same- 
lineage" and "splitting" cumulant pieces which lie along the common 
line of ancestry of the two addresses 101 and 110. 



Actually, things become even simpler. Since each Q has only two parameters, 
there are two basic kinds of branching cumulants, namely "samc-lincage" cu- 
mulants, involving only derivatives with respect to one of the two A's, and 
"splitting" cumulants involving both the left and right A. An arbitrary multi- 
variate cumulant will then obey the principle of common ancestry: 

A cumulant between n bins is the sum of 
all same-lineage branching cumulants at branch- 
ing points which are ancestors of ALL the relevant 
bins. The moment the first splitting in daughters 
occurs, the splitting cumulant of this branching 
point is added, and then the process stops. In 
other words, once at least one bin has "separated" 
into a different lineage, the last common branch- 
ing point contributes and nothing after that. 

This is expressed elegantly in mathemati- 
cal form with the help of the so-called ultametric 
distance d 2 , shown in Figure 5, which is simply 
a count of how many generations one must move 
up before a common ancestor is found. 

The ultrametric distance closely mimicks family terminology, e.g. siblings 
have ^2 = 1 and cousins d 2 = 2. For any bin addresses Ki, k 2 and n 3 , the 
cumulants can be expressed generically as d 




Figure 5: The ultrametric 
distance e?2 between bins. 



d\nZ T 



8X 



Ki 



(16) 



o 



d The assumption made here is that the splitting functions are the same at all branchings. 
This is not necessary, of course, but the formulae would become more complicated. 
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+ (1-^2,0) 



d 2 Q 



(17) 



The second term in ( |17| ) represents the splitting cumulant contribution; the 
(1— 5d 2 fl) makes sure that this is not counted when K\ and K2 are identical. The 
first term counts all the same-lineage branching cumulants; the prefactor (J — 
^2) represents the number of common ancestors. The same can be done in third 
order: defining the ultrametric distance as d% = max [^(ki, K2), ^2(^2, ^3), 
^2(^3, Ki)], the cumulant between any three bins is given by 



(J - da) 



d 3 Q 



+ (1 - 6 d3 fi) 



d 3 Q 



d 2 X L dX R 



(18) 



with a similar interpretation. Fourth order works similarly. 

Let us use the p-model as an example again. Insert Eq. (£ 
tranform (O) and take the logarithm to get 



into the Laplace 



Q(Ao,Ai) = i(A +A x ) ln(l-/3 2 )+lncosh 



KAo-AOln^^l 



(19) 



The a-model, characterised by p(qo, qi) = \ [6(qo — (1 + /?)) + S(q — (1 — (3))] 
x [S(q\ — (1 + /?)) + <5(<7i — (1 — (3))] resembles the p-model, except that now 
the go and qi can take on the values (1 + 0) and (1 — /?) independently. Its 
CBGF is found to be 



Q[A ,Ai] = ^(Ao + A^Ml-/? 2 ) 

'1 + 



(20) 



In cosh 



±A In 



1-/3 



In cosh 



hXt In 



1 + 
1-/3 



clearly resembling that of the p- model, but also clearly different. The law 
of common ancestry for these two models is illustrated nicely in the plot in 
Figure 6a of the second order cumulant C^, 1 ^, 2 against ultrametric distance cfe 
between bins Ki and K2, and in Figure 6b for the fourth order C/t ll K 2j K 3l /«4 
(note that some indices are set equal). Given J = 6, Eq. ( jl7j ) shows that for 
c?2 = 6 only the splitting cumulant contributes: it is negative for the p-model 
and zero for the a-model.^] Every time d.2 decreases by one, a constant (the 
same-lineage cumulant) is added, until da = 1- At c?2 = 0, the (negative) 
splitting cumulant does not contribute and so there is a jump in Ck ± ,k 2 f° r 
the p-model. Similar arguments apply in fourth order. 



This must naturally be so since go and q\ are independent in the a-model. 
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We should briefly mention some of 
the consequences of this analytical solu- 
tion. One important connection is that 
Jjamultifractals, which we show in Refs. 
Era to be equivalent to setting one of 
the two variables in Q(Ao,Ai) to zero. 
Hence we contend that our formalism 
can "see more" than multifractals can: 
indeed, the latter cannot distinguish be- 
tween the a- and p-models, while Fig- 
ure 6 shows that we can. 

In retrospect, it is obvious why the 
logarithm prescription works: by tak- 
ing the logarithm, the previously mul- 
tiplicative process becomes an additive 
one, which is easily handled and solved. 
Previously, conversion from multiplica- 
tive to additive variables could be done 
only by assuming scaling and then us- 
ing scaling exponentsp while the change 
in variable from e to In e does not rely 
on scaling assumptions at all. If there 
is any reason to suspect that multiplica- 
tive structures are at work, it is advis- 
able to work in logarithmic quantities. 



*• 1 

o 




Figure 6: Cumulants as 
the ultrametric distances. 



a function of 
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